• - # ^ Mof 

.Ci iF Errors Corroclod by Iho STIC sl^oms Branch / / 

Serial Number: Oy Q^T. pH"^ ^A. ' ' gL^^T ^'yf 

I 1 Changed a file from non-ASCII to ASCII " / Vorlflod by: 'r-'^ ~~ (sjic staff) 

□ Changed the margins in cases where Ihe sequence text was "wra^^^wn to the next line, j^ljj 
I 1 Edited a format error in the Cunent Application Data section, specifically: 

□ Edited the Current Application Data section with the acluaJ current number. TTie number inputted bv the 
applicant was □ Ihe prior application data; or □ other ^ 

□ Added the mandatory heading and subheadings (or -Cunrenl Application Data'. 

□ Edited the -Number of Sequences' field The applicant spelled out a number instead ol'using an integer. 
I I Changed the spelling of a mandatory fiefd (the headings or subheadings), specifically; 

□ Corrected Iho SEQ ID NO when obviously incorrect. The sequence numbers that were edited were: 



Q Inserted or correcled a nucleic number at the end of a nucleic lino. SEQ ID NO'S edited: 



□ Corrected subheading placement. All responses must be on Ihe same line as each subheadinq If tho 

applicant placed a response below the subheading, this was moved to its appropriate place. 

I 1 Inserted colons after headings/subheadings. Headings edited included: 



I 1 Deleted extra, invalid, headings ur an applicant. specificaJly: 



0^ Deleted>0^-ASCIi -gartage" at the beginning/end of files; n secretary initials/filename at end of file- 
□ page numbers throughout text; □ other invalid text, such as 

Inserted mandalory headings, specifically: 



□ Corrected an obvious erro.- in the response, specifically: 



□ Ediled identifiers where upper case is used but lower case is required, or vice versa 

□ Correcled an error in tho Number of Sequences field, specifically: 



□ A -Hard Pago Break" code was inserted by Ihe applicant. All occurrences had to be deleted. 

^ d"?^"^ v'^"?'" ^""'"^ ^"""^ sequences and adjusted the "(A) Length:' field accordingly (error 
duo to a Palentin bug). Sequences con-ecled: 



□ 



Other: 



Examiner: The above corrections must bo communicated to the applicant in the first Office 
Aclfon. DO NOT send a copy of this form. 3/1/95 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/622,74 5 

Input Set : A:\es.txt 

Output Set: N:\CRF3\06I22001\I62274 5.raw 

6 <110> APPLICANT: AstraZeneca AB 
8 <120> TITLE OF INVENTION: New methods 
10 <130> FILE REFERENCE: H 2174-1 WO 

12 <140> CURRENT APPLICATION NUMBER: US/09/622,745 

13 <141> CURRENT FILING DATE: 2000-08-22 

15 <160> NUMBER OF SEQ ID NOS : 2 

17 <170> SOFTWARE: PatentIn Ver . 2.0 



DATE: 06/12/2001 
TIME: 11:50:04 



Does Not Comply 
Corrected Diskette Needed 



ERRORED SEQUENCES 

115 <210> SEQ ID NO: 2 

116 <211> LENGTH: 4594 

117 <212> TYPE: DNA 

118 <213> ORGANISM: Homo sapiens 

120 <220> FEATURE: 

121 <221> NAME/KEY: GC_signal 

122 <222> LOCATION: Complement (( 40 80 )..( 4 0 87 ) ) 
12 4 <220> FEATURE: 

125 <221> NAME/KEY: GC_signal 

126 <222> LOCATION: Complement (( 4196 )..( 4205 ) ) 

128 <220> FEATURE: 

129 <221> NAME/KEY: GC_signal 

130 <222> LOCATION: Complement (( 4241 )..( 4249 ) ) 

132 <220> FEATURE: 

133 <221> NAME/KEY: GC_signal 

134 <222> LOCATION: Complement (( 4272 )..( 4279 ) ) 

136 <220> FEATURE: 

137 <221> NAME/KEY: misc_binding 

138 <222> LOCATION: ( 3844 )..( 3851 ) 

139 <223> OTHER INFORMATION: AP-2 

141 <220> FEATURE: 

142 <221> NAME/KEY: misc^binding 

143 <222> LOCATION: ( 430 8 ) . . ( 4 315 ) 

144 <223> OTHER INFORMATION: CRE 

146 <220> FEATURE: 

147 <221> NAME/KEY: misc_binding 

148 <222> LOCATION: ( 4375 )..( 4381 ) 

149 <223> OTHER INFORMATION: Initiator 

151 <400> SEQUENCE: 2 

152 atgttgctgc tgctgctact ggcgccactc ttcctccgcc ccccgggcgc gggcggggcg 60 

153 cagaccccca acgccacctc agaaggtgca tccttcttcg acgacctccg gccctccttc 120 
1.54 gctccacttc cctttccctg catctcctca tttctggtcc tcatcactat cccatcagtc 180 

155 ccacatatca tcccggtctg gcaacccctt ctgctcggcc cgactttact actgctgacc 240 

156 tccttctgtc accccacgtt actatccagc acctcttttc tctgcccaca ttgctacact 300 

157 ataccacctt cctgtgcatt ttctccgcct caatcccctt tcccagcccc acattactac 360 

158 ctcaattact cccttttctt ggtcccactt tgctgtccag atgatcttat tagcctccct 420 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/622,74 5 



DATE: 06/12/2001 
TIME: 11:50:04 



159 
160 
161 
162 
163 
164 
165 
166 
167 
168 
169 
170 
171 
172 
173 
174 
175 
176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 
188 
189 
190 
191 
192 
193 
194 
195 
196 
197 
198 
199 
200 
201 
202 
203 
204 
205 
206 
207 



ttatcctcct 
gctccaccca 
ccagcaccct 
tctctgtcct 
ccctcttcct 
tggggagcct 
ccttaggttg 
ctcgggacca 
gccgggggga 
ggacagatat 
caagactatt 
atccagtata 
tgctctgtgc 
tcattaggcc 
ttgaggatca 
tgaatacctc 
ttaatagaag 
atctacatat 
ttgtgtgctt 
atgtgtggat 
catgtgattt 
tctttaaagg 
agtccgaatc 
tggggacctc 
tctggtgggc 
ctgccagggt 
gggagtgggg 
acactggaaa 
tgcagtcccc 
ttcctgctcc 
tcaccttctt 
gcatcaccct 
gactcagaaa 
gaaacatatt 
actcagggat 
tattccctaa 
aattttacaa 
tgcagtcatc 
cttgggcttt 
tggaaattga 
ggttcctcac 
tgtgttcata 
gcttaaagcc 
aggcaccaca 
tgttgtccta 
ccttgttctt 
ttcagcatgg 
ggtaactctt 
tgagatgaga 



Input Set : 
Output Set: 

atcctaattc 
catatcatac 
tccctcccca 
ttagccaggg 
ctgtccccag 
gccttttcct 
ccagatcata 
ggtgaaggct 
gcgcgaggtg 
ggacacaccc 
ccttttcctg 
ccaaaattca 
ggttggaggt 
actgatcatt 
ttagtggcag 
tgtcaccaac 
ccaactataa 
ttcaaagcca 
atagccccat 
gcttgcacag 
tctttagagc 
gaaagagcct 
tgctccaagt 
ccagctctgg 
agctcccgga 
gaggggaaca 
tgggaatgga 
ggggagatga 
ggtatcctct 
cgattcttag 
caggtttaaa 
cttctagaat 
taaatcaaac 
agggtaaaag 
ggggtgggaa 
gggaatcagt 
actctccctt 
ccatttgtat 
actttcctca 
gagtcccatc 
ctacagggcc 
tggtaacagg 
aattgtatgt 
gaaccaagat 
tgaaaaatcg 
atccccaggc 
acaggtgtgg 
ctctgatccc 
aacccttacc 



A: \es . txt 

N:\CRF3\06I22001\I622745.raw 



aactggaata 
ccttcatgat 
cttgtgggtt 
gatctgtacc 
cctgcccaca 
ctttcccacc 
cacccgccct 
atcaacttcc 
gtggggccca 
agccgctgtg 
ccgcaaactt 
caaatacatt 
agtagagcta 
ggcctataac 
atgatgacaa 
ccagagacca 
taagaaaagc 
gagaaccgcc 
cttgggttcc 
gtaagggagg 
aggatgcttg 
tcttcaactc 
cttatttgac 
acggagcccg 
gcatctgtag 
gctgcctgca 
taatgggaaa 
aagtcccttt 
gtaggttggg 
acctcacgtt 
tactccaatt 
tcatccctca 
attctactaa 
acttaaaatt 
gaagacaggt 
gttgtgtctg 
ttctaggcac 
gcctcatact 
cataactcag 
caggggtgga 
ctcttcctgt 
cttttgtaaa 
gtaatttttt 
ctgccccaaa 
aagaagaaaa 
accctcccct 
gagggggctg 
cgtctttcct 
gcgcgcactg 



tcctcattta 
ttcttaatta 
ctctcatcag 
tgtccccact 
gaccacgccc 
attcctctct 
gggaaggggg 
tgccagtgga 
aggtccgcaa 
gtgagtagcc 
agcattactg 
tattgaatga 
gcagcctgca 
attgataatt 
aaaaattcta 
tatgcccaag 
aaatctgatt 
cactgtagct 
taaaatggta 
attggaagat 
tggacccaaa 
gcctctcttc 
cctggaaaat 
ggtggatttc 
tcagggccag 
tgcagctgat 
gaatggagag 
ttcctccatc 
ggcttccttc 
ttctcttttc 
ttccctttct 
ccattcctta 
gaaaaattga 
ggaggcagca 
ccttttctgk 
tctacytttt 
ccgaactctc 
tcctctaccc 
ttattctgct 
cttatgacac 
gctctaataa 
aattgcagaa 
ttcttaaaga 
cttagctatt 
taagtcctga 
cagaaacgca 
gggatcaggc 
gctgccagtg 
caatgccctc 



gccttttttt 
cttttctttc 
ctttaaccct 
cccaccctct 
tactctcccc 
gtatgcctcc 
catcaggtac 
ctatgagatt 
gtgcctggcc 
tcggaagccc 
cttgcaagtc 
ctactacata 
cagttcattt 
catcttgtca 
aaatgatttc 
aaacaaaagc 
gtgcatccaa 
gactttgaag 
attttttttt 
aggtaggcaa 
cctgcacctg 
ttattttcct 
gggaaggttt 
cggtgtgacc 
tggagcaccc 
gaggacgctt 
ctataaaaat 
acctgcctca 
ctttaccttt 
ctttatgaat 
ctaaacttag 
tataattgat 
gaaggggagc 
ttatcagaag 
acttcctaga 
tttttttttt 
tgccatcttc 
tggtagattc 
tctagtttac 
tactgaaact 
tatagagggc 
ataagatttt 
ctcccaattt 
ggcattcccg 
cccccttacc 
ggcttctgct 
cagggaagct 
aatcgaacgc 
cccttcactc 



ttaaagaaaa 
ttacctccac 
ggccctttac 
agtgccccat 
ttcctcccac 
ccgactcacc 
cggggcctga 
gagtatgtgt 
aacggctcct 
ctcccctctt 
agcactttaa 
agagcaattt 
catcctccct 
gttattctct 
atcacatttt 
cagtttaata 
agttatatac 
agatcccatt 
tcttttggga 
atccttttca 
agtcccctgc 
atctctccac 
tcctgacggg 
ccgacttcca 
ccaagcccca 
gtgtgaggat 
gtgggggagg 
aacttcctct 
taaaaaaatc 
ctcacctctc 
aaatttccat 
ttattgtaaa 
tctgggggtg 
atgaagaaca 
caacctccat 
tttgccacgt 
tctcctggga 
tttcaagatc 
cattttattc 
tagacttcaa 
tcgatggata 
aacagcaatt 
tgtaatattc 
tctcaaattc 
cccagaccca 
ctccccggtc 
gggcgccagt 
cacactcagg 
tgcaccctcc 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/622,745 



DATE: 06/12/2001 
TIME: 11:50:04 



Input Set : A:\es.txt 

Output Set: N:\CRr3\06122001\I62274 5.raw 



208 acccccctga aattctgccc ttaggctacg gggcgtcgtc ctttcgcacc ttccccaacc 3420 

209 caccccagtt tgcggccacc cccttccctc cctacctgtt tcctgcctcc agtcccggtt 3480 

210 ttccacgagg ctgcggtctc tccttgtccc tgcttggcta cacttccctg ggctccacct 3540 

211 cctcccagac tgagcctcgc cggtgtcagg cagagcccag cagarggcgg cagggtgctg 3600 

212 ggagaccctg agctcccacc acgttttccc ctgtggggtt ccttgcgacc ttcgctggaa 3660 

213 ccttttccag cctgctgcct cctaggattt cacctaatgg actttctcag cctgtcccac 3720 

214 ccatcccaac cctggccagg cctctcgcgc tcttccccac atcttttcct tccgtgtacc 3780 

215 ccttccctcg tcttttctca attccatgtc ctgtctccct ttcttaggct tctgtctacc 3840 

216 cagccccagg ctcccttcca cgaccccacc actccctcaa accagcctcc cttccgtacc 3900 

217 caactcgttc cctccaaaac cgtttcctct cccccacatc ctcagtgctt cactgtatcg 3960 

218 actcatactc ccacttcaga cctcaggcgc cagccccgtt tctctcccgt cccactcgca 4020 

219 tccttccctt cctaccctgg ttcctccgtg cttcagcctc ccgcggctcc ctccgcccac 4080 

220 cccgccctcc tggcacgccc cgtccccatt tctcctcccc tcgggtcccc ttaagtgaga 4140 

221 tccctccctt cctctttcgt tcctttcctc ctcgaggttg catcccccct cccctccccg 4200 

222 cccctccgac tgtcgctccc acctcggcgc tcgcttccct ccccgccccc ttcctgcctc 4260 

223 cccagctccc gcccgccccc ccaccccccg ctgccgcgcg ccgcccgtga cgtcagagcc 4320 

224 ccctcccagc cccacatctc cctcctgctc ctcctcctcc cctccgtcgg tcagtcagtc 4 3 80 

225 cgcgaggaga gtccgcggtg gcggcgacgg tggcgagagc cgcgggggcc gtaggaagcc 4440 

226 aaccttccct gcttctccgg ggccctcgcc ccctcctccc cacaaaatca gggatggagg 4500 

227 cgcctccccg gcaccctctt agcagccctc cccgggaaaa gtgtcccccc tgagctccta 4560 

228 acgctcccca acagctaccc ctgcccccca cgcc 4594 
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VERIFICATION SUMMARY DATE: 06/12/2001 

PATENT APPLICATION: US/09/622,74 5 TIME J 11:50:05 

Input Set : A:\es.txt 

Output Set: N:\CRF3\0612200I\I622745.raw 

L:12 M:270 C: Current Application Number differs. Replaced Application Number 
L:13 M:271 C: Current Filing Date differs. Replaced Current Filing Date 
L:233 M:254 E: No. of Bases conflict, LENGTH : Input : 4 Counted:4594 SEQ:2 
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Namnlos 

s . 

CC Where differences are found these are annotated as variations 
CC together with a note of the overlapping clone name. Note that th 
e 

CC variation annotation may not be found in the sequence submission 

CC corresponding to the overlapping clone, as we submit sequences 

CC with only a small overlap as described above . 

CC This sequence is the entire insert of clone 271M21. 

CC This sequence has been finished according to sequence map criter 

ia as 

CC follows. An attempt is made to resolve all sequencing problems, 
such 

CC as compressions and repeats, but not necessarily within known 
CC annotated human- repeat sequence elements (e.g. Alu) . Where the 
CC sequence is ambiguous, there is an ar^inotation using the "unsure" 
CC feature key. 

CC This sequence was generated from part of bacterial clone contias 
of 

CC human chromosome 6, constructed in collaboration by the Sanger C 
entre 

CC chromosome 6 mapping group and Armin Volz & Andreas Ziegler. Fur 
ther 

CC information can be found at http://www.sanger.ac.uk/HGP/Chr6/ 
CC 271M21 is from the library RPCIl constructed at the Roswell Park 



SCORES Initl: 6935 Initn: 15224 Opt: 19389 z-score: 3034.4 E( 

) : 0 

99.7% identity in 3909 bp overlap 

10 20 

30 

Pagapr 1 . Dna GATCATATTAATTTGAAGGTGGCGGG 
GCAG 

I I I I I I I I I M I I I I I I I I I I I I I I I 

I I I I 

Hs 2 7 lm2 1 CCCAAGGAAGGAGTCAAGACTGAGAACAACGATCATATTAATTTGAAGGTGGCGGG 
GCAG 

1580 1590 1600 1610 1620 1630 

40 50 60 70 80 

90 

Pagapr 1 . Dna GATGGTTCTGTGGTGCAGTTTAAGATTAAGAGGCATACACCACTTAGTAAACTAAT 
GAAA 



Namnlos 

I I I I I I I M I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I M I I I I I I I I I I I I I 

MM 

Hs271m21 GATGGTTCTGTGGTGCAGTTTAAGATTAAGAGGCATACACCACTTAGTAAACTAAT 
GAAA 

1640 1650 1660 1670 1680 1690 



100 110 120 130 140 

150 

Pagaprl . Dna GCCTATTGTGAACGACAGGGATTGTCAATGAGGCAGATCAGATTCCGATTCGACGG 
GCAA 

MM 



M M M M M M M M M M I M M M M M M M M M I M M M M M M I M I 



Hs271m21 GCCTATTGTGAACGACAGGGATTGTCAATGAGGCAGATCAGATTCCGATTCGACGG 
GCAA 

1700 1710 1720 1730 1740 1750 



160 170 180 190 200 

210 

Pagapr 1 . Dna CCAATGAAACAGACACACCTGCACAGTTGGAAATGGAGGATGAAGATACAATTGAT 



GTGT 



M M M I M M M M M M I M I I M M M I M I M I M I M M M M M M M I 



Hs2 7 lm2 1 CCAATGAAACAGACACACCTGCACAGTTGGAAATGGAGGATGAAGATACAATTGAT 
GTGT 

1760 1770 1780 1790 1800 1810 



220 230 240 250 260 

270 

Pagapr 1 . Dna TCCAACAGCAGACGGGAGGTGTCTACTGAAAAGGGAACCTGCTTCTTTACTCCAGA 
ACTC 

M M I M M I M I M M M M M I M M M M M M M M M I M I M M M M M 

MM 

Hs2 71m21 TCCAACAGCAGACGGGAGGTGTCTACTGAAAAGGGAACCTGCTTCTTTACTCCAGA 
ACTC 

1820 1830 1840 1850 1860 1870 



280 290 300 310 320 

330 

Pagaprl . Dna TGTTCTTTAAAGACCAAGATTACATTCTCAATTAGAAAACTGCAATTTGCTTCCAC 
CACA 

M M I M M M M M I M M I M M M I M M M I M M I M I M M M M M I M 

MM 

Hs2 71m21 TGTTCTTTAAAGACCAAGATTACATTCTCAATTAGAAAACTGCAATTTGCTTCCAC 

3 



CACA 



1880 



Nartinlos 
1890 1900 



1910 



1920 



1930 



390 



340 



350 



360 



370 



380 



Pagaprl .Dna TCCTGACTACTACCGTATAGTTTTCTCTATTCTTTCATTTCCCCCTTCCCCATTCC 
TTTA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs271m21 TCCTGACTACTACCGTATAGTTTTCTCTATTCTTTCATTTCCCCCTTCCCCATTCC 
TTTA 

1940 1950 1960 1970 1980 1990 



450 



400 



410 



420 



430 



440 



Pagaprl .Dna CTGTACATAAAGTAACTGGTATATGTGCACAAGCATATTACTTTTTTTTTTTAAAA 
CTAA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I M I I I I M I I I I I I I I I I I I I 



Hs271m21 
CTAA 



CTGTACATAAAGTAACTGGTATATGTGCACAAGCATATTACTTTTTTTTTTTAAAA 
2000 2010 2020 2030 2040 2050 



509 



460 



470 



480 



490 



500 



Pagaprl . Dna ACAGCCAATGGTATGTTTTGATTGACATCAAGTGGAGACGGGG-GGGAAAATACTG 
ATTC 

I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M II I II II 

I II I 

Hs27 lm2 1 ACAGCCAATGGTATGTTTTGATTGACATCAAGTGGAGACGGGGCGGAAAAATACTG 
ATTC 

2060 2070 2080 2090 2100 2110 



510 



520 



530 



540 



550 



560 



569 



Pagapr 1 . Dna TGTGAAAATACCCCCTTTCTCCATTAGTGGCATGCTCATTCAGCTCTTATCTTTAT 
ATTC 

I M I I M I I I I I II I I M I II II I II I I I I I I II I I M I I I II I I I I I I I I I I I II 

MM 

Hs2 7 lm2 1 TGTGAAAATACCCCCTTTCTCCATTAGTGGCATGCTCATTCAGCTCTTATCTTTAT 
ATTC 



2120 



2130 



2140 



2150 



2160 



2170 



Namnlos 



570 580 590 600 610 620 

629 

Pagaprl . Dna CAGTAAGTTATTTTGCTCTCACTGTTTTAACAACAACAACAAAAAAACAACAACAT 
AAAA 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs2 7 lm2 1 CAGTAAGTTATTTTGCTCTCACTGTTTTAACAACAACAACAAAAAAACAACAACAT 
AAAA 

• 2180 2190 2200 2210 2220 2230 



630 640 650 660 670 680 

689 

Pagaprl . Dna ATCCTTGCATACCTTGTTCAATTGGAGAATTTTAATGTTTTTCATTTATCATTGTA 
AAAC 

I I I I I M I I I I I I I I j I I I I I I I i I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I 

I I M 

Hs2 71m21 ATCCTTGCATACCTTGTTCAATTGGAGAATTTTAATGTTTTTCATTTATCATTGTA 
AAAC 

2240 2250 2260 2270 .2280 2290 



690 700 710 720 730 740 

749 

Pagaprl . Dna CAAGGACAATTTTATAACTTTTTTGTACTTAGCTGTTACATGCAGAGCAATCTGTC 
TTTA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I 

I I I I 

Hs271in21 CAAGGACAATTTTATAACTTTTTTGTACTTAGCTGTTACATGCAGAGCAATCTGTC 
TTTA 

2300 2310 2320 2330 2340 - 2350 

750 760 770 780 790 800 

809 

Pagaprl . Dna AGTAGGGATAAATTACTCTAAAACAAAAAAGAATCCTAGATAGTTTTCCCTTCAAG 
TCAA 

M I I I i I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I M 

Mil 

Hs2 7 lm2 1 AGTAGGGATAAATTACTCTAAAACAAAAAAGAATCCTAGATAGTTTTCCCTTCAAG 
TCAA 

2360 2370 2380 2390 2400 2410 



810 820 830 840 850 860 

5 



Namnlos 

Pagapr 1 . Dna GCGTCTTGTTGTTTAAATAAACTTCTTGTTT-AAAAAAAAAAAAAGTAAAAAAGAA 
AAGT 

M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I i I I I I I I I I I I i I 

I I I I 

Hs2 7 lm2 1 GCGTCTTGTTGTTTAAATAAACTTCTTGTTTAAAAAAAAAAAAZU^GTAAAAAAGAA 
AAGT 

2420 2430 2440 2450 2460 2470 

870 880 890 900 910 920 



Pagapr 1 . Dna TATGCAACAATTAATGGCCCAGAGGCAATCCTTGTTAACATTTTGATGCATCTTTT 
AGCT 

M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs2 71m21 TATGCAACAATTAATGGCCCAGAGGCAATCCTTGTTAACATTTTGATGCATCTTTT 
AGCT 

2480 2490 2500 2510 2520 2530 



930 940 950 950 970 980 

Pagapr 1 .Dna G-TTTTTTTTTTTTTTTTTTTTTTTGACTGAGTTTGACTCTTGTCACCCAGGCTGA 
AGTG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I M I I I I I 

I I I I 

Hs2 71m21 GTTTTTTTTTTTTTTTTTTTTTTTTGACTGAGTTTGACTCTTGTCACCCAGGCTGA 
AGTG 

2540 2550 2560 2570 2580 2590 



990 1000 1010 1020 1030 1040 

Pagaprl . Dna CAATGGCATGGCATGATCTTGGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGTGA 
TTCT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

I I I I 

Hs271m21 CAATGGCATGGCATGATCTTGGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGTGA 
TTCT 

2600 2610 2620 2630 2640 2650 



1050 1060 1070 1080 1090 1100 

Pagapr 1 . Dna CCTGCCTCAGCCTCCTGAGTAGCTAGGATTACGGGCATGCACCACCATGCCTGGCT 
AATT 

6 



Naranlos 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs2 71m21 CCTGCCTCAGCCTCCTGAGTAGCTAGGATTACGGGCATGCACCACCATGCCTGGCT 
7VATT 

2660 2670 2680 2690 2700 2710 

1110 1120 1130 1140 1150 1160 

Pagaprl . Dna TTGTATTTTTAGTAGAGTTGGGGCTTCTCCACACTGGTCAGGCTGGTCTCGAACTC 
CCAA 

I I I M I I I I I I I I I I I I I i i I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

I I I I 

Hs2 71m21 TTGTATTTTTAGTAGAGTTGGGGCTTCTCCACACTGGTCAGGCTGGTCTCGAACTC 
CCAA 

2720 2730 _ 2740 2750 2760 2770 

1170 1180 1190 1200 1210 1220 

Pagaprl . Dna CCTCAGGTGATAAGGGAAGGGGCACTATTGACATTTATGGTTGGGGCAGAGGTGTA 
AGAT 

I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I M I [ I I I I I I I I I I I I I I I I M I 

I I I I 

Hs271in21 CCTCAGGTGATAAGGGAAGGGGCACTATTGACATTTATGGTTGGGGCAGAGGTGTA 
AGAT 

2780 2790 2800 2810 2820 2830 

1230 1240 1250 1260 1270 1280 

Pagaprl . Dna ATTCTTCAAAGCACTACCTACATGTTGAAGAATTGTTCCTCACCCAGATTCTC7\AA 
AGTC 

I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I 

I I I I 

Hs271m21 ATTCTTCAAAGCACTACCTACATGTTGAAGAATTGTTCCTCACCCAGATTCTCAAA 
AGTC 

2840 2850 2860 2870 2880 2890 

1290 1300 1310 1320 1330 1340 

Pagaprl . Dna CCCCAGGACATTCACGTAGTGAAAACCTGTGTTTAATTATCTGAGCCTATAACTTA 
ATAC 



I I 



M I I I I I I I N I I I I I I I I I I I I I M I I I III I I M I I M I I I I I I I I I I I I I I I 



Namnlos 

Hs271m21 CCCCAGGACATTCACGTAGTGAAAACCTGTGTTTAATTATCTGAGCCTATTVACTTA 
ATAC 

2900 2910 2920 2930 2940 2950 



1350 1360 1370 1380 1390 1400 

Pagaprl . Dna AGTTTTAAAATTTTTTTTTAAATATACAGTGAACTTTCTAGGAATGCAATTATAGT 
TGTG 

I I I I I ! M I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs2 71m21 AGTTTTAAAATTTTTTTTTAAATATACAGTGAACTTTCTAGGAATGCAATTATAGT 
TGTG 

2960 2970 2980 2990 3000 3010 



1410 1420 1430 1440 1450 1460 



Pagaprl . Dna TGTAAAATTAGGGAAAATTAACTTTGCTACCAAGAGTTGTTCAACATTTTGTTAAA 
TCAC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs 2 7 lm2 1 TGTAAAATTAGGGAAAATTAACTTTGCTACCAAGAGTTGTTCAACATTTTGTTAAA 
TCAC 

3020 3030 3040 3050 3060 3070 

1470 1480 1490 1500 1510 1520 

Pagaprl . Dna TTCATTGATGGCAACATGCTGGAGGTAGTTGAGTCACCAACTCAGCACCTGGATCA 
GCCT 

M I M I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I.I I I I I I I I I I I 

Mil 

Hs271m21 TTCATTGATGGCAACATGCTGGAGGTAGTTGAGTCACCAACTCAGCACCTGGATCA 
GCCT 

3080 3090 3100 3110 3120 3130 



1530 1540 1550 1560 1570 1580 

Pagaprl . Dna GTGTTGGTAGCAGTTTCATCCCCGTGGTTCTGTGAATAGGTGGAAGCATCTGCTTA 
CTCC 

I M I I I I I M I M M II I I I I I I I I II I I M I II M I II M I I I I I M I II I I I M 

I I I I 

Hs2 71m21 GTGTTGGTAGCAGTTTCATCCCCGTGGTTCTGTGAATAGGTGGAAGCATCTGCTTA 
CTCC 

3140 3150 3160 3170 3180 3190 

8 



Namnlos 



1590 1600 1610 1620 1630 1640 

Pagaprl . Dna ATCAGGACTTCTAGGGTAGTCGGGCCTTGGCACTCACACATTAAAATACTGTTTAT 
GTTA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs271m21 ATCAGGACTTCTAGGGTAGTCGGGCCTTGGCACTCACACATTAAAATACTGTTTAT 
GTTA 

3200 3210 3220 3230 3240 3250 



1650 1660 1670 1680 ,1690 1700 

TTTTATTGCAAGTTACTTTTCTTTCATTTCCCCTTTACGTTACAGAAAGGGAAGCA 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TTTTATTGCAAGTTACTTTTCTTTCATTTCCCCTTTACGTTACAGAAAGGGAAGCA 
3260 3270 3280 3290 3300 3310 



1710 1720 1730 1740 1750 1760 

Pagaprl . Dna GCTTTCTGTTTAAAGTTGTGTATGTAGGTAGGTTATATCATCTAWGACTTTCTCTC 
CCTC 

I I I I I I M I I i I i I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I : I I I I I |.| I I M 

INI 

Hs2 71m21 GCTTTCTGTTTAAAGTTGTGTATGTAGGTAGGTTATATCATCTATGACTTTCTCTC 
CCTC 

3320 3330 3340 3350 3360 3370 

1770 1780 1790 1800 1810 1820 

Pagaprl . Dna CTTCCCTTTCTTTTTGTTTGAGATGGAGTCTTGCTCTGTCACCCAGGCTGGAGTGC 
AGTG 



Hs2 71ra21 CTTCCCTTTCTTTTTGTTTGAGATGGAGTCTTGCTCTGTCACCCAGGCTGGAGTGC 
AGTG 

3380 3390 3400 3410 3420 3430 



1830 1840 1850 1860 1870 1880 

9 



Pagaprl . Dna 
TTTT 



Hs271m21 
TTTT 



Pagapr 1 . Dna 
TCAG ■ 



Namnlos 

GTGCGATCTTGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAGCGATTCTGGTGTC 



I I I I 

Hs2 71m21 GTGCGATCTTGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAGCGATTCTGGTGTC 
TCAG 

3440 3450 3460 3470 3480 3490 



1890 1900 1910 1920 1930 1940 

Pagapr 1 . Dna CTGGGATTACAGGCGCACACCATCACACCACGCTAATTTTTCTATTTTTAGTAGAG 
ATGG 



Hs2 71m21 CTGGGATTACAGGCGCACACCATCACACCACGCTAATTTTTCTATTTTTAGTAGAG 
ATGG 

3500 3510 3520 3530 3540 ' 3550 



1950 1960 1970 1980 1990 2000 



GGTTTCGCCATGCTGGCCAGGCCAGGCTGGTCTCAAACTCCTGAGCTCAAGTGATC 
M I I I I I I I I I I I I [ I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I M I I 
GGTTTCGCCATGCTGGCCAGGCCAGGCTGGTCTCAAACTCCTGAGCTCAAGTGATC 
3560 3570 3580 3590 3600 3610 



2010 2020 2030 2040 2050 2060 

Pagapr 1 . Dna CGCCTCGGCCTCCCAAAGTTCTGGGATTTCAGGCGTGAGCCTCATCTATGAATCTC 
AATT 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 

I I I I 

Hs271m21 CGCCTCGGCCTCCCAAAGTTCTGGGATTTCAGGCGTGAGCCTCATCTATGAATCTC 
AATT 

3620 3630 3640 3650 3660 3670 



2070 2080 2090 2100 2110 . 2120 

Pagaprl , Dna TAGGACAGTAAAAGTGTCATTACAAAAATATTTATTGTAAAAAAGGGTTGGAGGTT 

10 



Pagaprl , Dna 
AGTC 

I I I I 

Hs271m21 
AGTC 



Namnlos 

GAGA 



Hs271m21 TAGGACAGTAAAAGTGTCATTAC-AAAATATTTATTGTAAAAAAGGGTTGGAGGTT 
GAGA 

3680 3690 3700 3710 3720 373 

0 

2130 2140 2150 2160 2170 2180 

Pagapr 1 . Dna ATCTCAATTCTAGTCAGTCTCTCAGTGTTTGGTTTCTTCCTACCATTTTTCCCCCT 
AGGA 



Hs271ni21 ATCTCAATTCTAGTCAGTCTCTCAGTGTTTGGTTTCTTCCTACCATTTTTCCCCCT 
AGGA 

3740 3750 3760 ' 3770 3780 379 

0 

2190 2200 2210 2220 2230 2240 

Pagaprl . Dna CCAGCCAGAAAGCAGCTTTTTTTTTGTCCCCCCCAACAAGGAGCCCACTGTTTCCT 
CTCC 



Hs2 71m21 CCAGCCAGAAAGCAGCTTTTTTTTTGTCCCCCCCAACAAGGAGCCCACTGTTTCCT 
CTCC 

3800 3810 3820 3830 3840 385 

0 

2250 2260 2270 2280 2290 2300 

Pagaprl . Dna CAGCCCA7\ACTCAGGCCTACGAACAACAACAGCAC — TACACACACACACACACAC 
ACAC 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 

MM 

Hs271m21 CAGCCCAAACTCAGGCCTACGAACAACAACAGCACAACACACACACACACACACAC 
ACAC 

3860 3870 3880 3890 3900 391 

0 

2310 2320 2330* 2340 2350 2360 

Pagaprl . Dna ACACACACACACACACACCCCTCCACTTCAAGGTATAGCCAAGAGCTTCTGGAGCC 
GTCA 



11 



Namnlos 

Hs271in21 ACACACACACACACACACCCCTCCACTTGAAGGTATAGCCAAGAGCTTCTGGAGCC 
GTCA 

3920 3930 3940 3950 3960 397 

0 

2370 2380 2390 2400 2410 2420 

Pagaprl . Dna AAAAGGTCTGTACCTGCTGTCTTTAGAGCTTCCAGTTTGCCCTTGGTCAAGAAATA 
CTGT 

I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I M I I I M I I 

I I I I 

Hs271m21 AAAAGGTCTGTACCTGCTGTCTTTAGAGCTTCCAGTTTGCCCTTGGTCAAGAAATA 
CTGT 

3980 3990 4000 4010 4020 403 

0 

2430 2440 2450 2460 2470 2480 

Pagaprl . Dna TTGCTAGGCTCTGCTGGAGTACATCAGGTAATACTGGCTTCTAAACCACCCTGAGG 
TTCT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I M I I I I I I I I I I I I I I I M I i I I I 

I I I I 

Hs271m21 TTGCTAGGCTCTGCTGGAGTACATCAGGTAATACTGGCTTCTAAACCACCCTGAGG 
TTCT 

4040 4050 4060 4070 4080 409 

0 

2490 2500 2510 2520 2530 2540 

Pagaprl . Dna TTTCTCTTGTCCTTTTACTCCCTTCGTACTTCAATTTCTCTCCTTGATGTCCCCCT 
CCCT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I [ I I I I I M 

1 1 1 1 

Hs2 71m21 TTTCTCTTGTCCTTTTACTCCCTTCGTACTTCAATTTCTCTCCTTGATGTCCCCCT 
CCCT 

4100 4110 4120 4130 4140 415 

0 

2550 2560 2570 2580 2590 2600 

Pagaprl . Dna GTTTTGTTTTTTGCCTCCAATCCGTTCTGCGCGTTCCCTGCAGAGCAGGCGAGTAG 
CAAT 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 

I I I I 

Hs271m21 GTTTTGTTTTTTGCCTCCAATCCGTTCTGCGCGTTCCCTGCAGAGCAGGCGAGTAG 
CAAT 

4160 4170 4180 4190 4200 421 

12 



0 



Namnlos 



2610 2620 2630 2640 2650 2660 

Pagaprl . Dna GCTGCTGGACCATGGAGCTGCTCTAGTCTCCCAGAAATCTCTTCTACACCCAACCC 
TTCT 

I I I I M I I I I I I I I I I I I i I I I I M I I I I I I I I I I M I I I I I I I I I M I I M I I I I 

I I I I 

Hs271m21 GCTGCTGGACCATGGAGCTGCTCTAGTCTCCCAGAAATCTCTTCTACACCCAACCC 
TTCT 

4220 4230 4240 4250 4260 427 

0 

2670 2680 2690 2700 2710 2720 

Pagaprl . Dna TGCGCTTAGGTGGTCCTCAGTCCCCCTCCCCCACCTCCTTCTGACCCAGGCTTCTT 
TCTC 

I I I I I M I I I I M I I I I I M I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

1 1 1 1 

Hs271iti21 TGCGCTTAGGTGGTCCTCAGTCCCCCTCCCCCACCTCCTTCTGACCCAGGCTTCTT 
TCTC 

4280 4290 4300 4310 4320 433 



2730 2740 2750 2760 2770 2780 

Pagaprl . Dna GCCCTCCGGTCGCAGTTCTCCTGGGCATCTGCCTCTGCCTCTCTCCTCTCACCCGG 
ATCT 

I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I M I I I M I I I I I I I I I M M 

MM 

Hs271m21 GCCCTCCGGTCGCAGTTCTCCTGGGCATCTGCCTCTGCCTCTCTCCTCTCACCCGG 
ATCT 

4340 4350 4360 4370 4380 439 



2790 2800 2810 2820 2830 2840 

Pagaprl . Dna AGGGCTGCCTTCTCTTTGTGCAGCCGTCTTTCTCCACCTTCATCCCAGACTCCCTG 
TCTC 

M M M M M M I M I M M M M M i M M I M M M M M M M M M I M I M 

MM 

Hs271m21 AGGGCTGCCTTCTCTTTGTGCAGCCGTCTTTCTCCACCTTCATCCCAGACTCCCTG 
TCTC 

4400 4410 4420 4430 4440 445 

0 

13 



Namnlos 

2850 2860 2870 2880 2890 2900 

Pagaprl . Dna AGCGCCAGCTCCTCTGCCTTTGGCTCGGGTTCCCTCTCCCCCACCCCAGCTTCCAG 
TTGT 

I i I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I i I .I I I 

MM 

Hs271m21 AGCGCCAGCTCCTCTGCCTTTGGCTCGGGTTCCCTCTCCCCCACCCCAGCTTCCAG 
TTGT 

4460 4470 4480 4490 4500 451 



2910 2920 2930 2940 2950 2960 

Pagaprl . Dna TTGGCCCGCAGGTCCCTCGGCAGTGACCGGCGCCCCCCGACGAGTGCGTGTGCACC 
AGGG . 

II I II II I I II I II II M I I II II I I I I II I II M I M I II I I I II I II II M I I I 

MM 

Hs2 71m21 TTGGCCCGCAGGTCCCTCGGCAGTGACCGGCGCCCCCCGACGAGTGCGTGTGCACC 
AGGG 

4520 4530 4540 4550 4560 457 

0 

2970 2980 2990 3000 3010 3020 

Pagapr 1 . Dna CACCTCCCTCTCCCCCACCTCTCAGCCCCGCGCCTCTCCACCGCCCGCCCCACCGC 
GCTG 

II I I M I I II I I II II I II II I I I I II I I I II II I II I II I I II II I II II II I M 

I II I 

Hs2 7 11X121 CACCTCCCTCTCCCCCACCTCTCAGCCCCGCGCCTCTCCACCGCCCGCCCCACCGC 
GCTG 

4580 4590 4600 4610 4620 463 

0 

3030 3040 3050 3060 3070 3080 

Pagapr 1 . Dna TGGGCGGTCCAGGGCGGGGCTGGGATCCGGGGCGGCTCCCGGGGCTCGGGTTGTGG 
GAGG 

i M I II I I I M I I II II II I II I II I M II I M I II M I II II I I II I M II II I I 

MM 

Hs2 71in21 TGGGCGGTCCAGGGCGGGGCTGGGATCCGGGGCGGCTCCCGGGGCTCGGGTTGTGG 
GAGG 

4640 4650 4660 4670 4680 469 

0 

3090 3100 3110 3120 3130 3140 

Pagaprl . Dna CGCCCTCTCCCCGGTCTTCCCCTCTCTTCCCCCCGCCCTGCCTTCCCTTGCACCCT 

14 



Namnlos 

CCTT 

1 1 1 1 1 1 1 1 1 n I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I 

I I I I 

Hs271m21 CGCGCTCTCCCCGGTCTTCCCCTCTCTTCCCCCCGCCCTGCCTTCCCTTGCACCCT 
CCTT 

4700 4710 4720 4730 4740 475 



3150 3160 3170 3180 3190 3200 

Pagaprl . Dna CTTCCCTCCGCCCGGGAGCTCTCCCTGGTCCCCGGCGCCGCCTCCTTCCCTCCCGG 
CTCC 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M i I I I I I i I I I I M I I I 

I I I I 

Hs271m21 CTTCCCTCCGCCCGGGAGCTCTCCCTGGTCCCCGGCGCCGCCTCCTTCCCTCCCGG 
CTCC 

4760 4770 4780 4790 4800 481 

0 

3210 3220 3230 3240 3250 3260 

Pagaprl . Dna CCGCTCCCCGCTCCCGTGGCTGCCGCCGCCCCGGGGAAGAAGAGACAGGGGTGGGG 
TTTG 

I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I M i I I I I I I I I I I I I 

I I I I 

Hs2 7 lm2 1 CCGCTCCCCGCTCCCGTGGCTGCCGCCGCCCCGGGGAAGAAGAGACAGGGGTGGGG 
TTTG 

4820 4830 4840 4850 4860 487 



3270 3280 3290 3300 3310 3320 

Pagaprl . Dna GGGGAAGCGAGAGAGGAGGGGAGAGACCCTGGCCAGGCTGGAGCCTGGATTCGAGG 
GGAG 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I 

I I I I 

Hs27 lm2 1 GGGGAAGCGAGAGAGGAGGGGAGAGACCCTGGCCAGGCTGGAGCCTGGATTCGAGG 
GGAG 

4880 4890 4900 4910 4920 493 

0 

3330 3340 3350 3360 3370 3380 

Pagapr 1 . Dna GAGGGACGGGAGGAGGAGAAAGGTGGAGGAGAAGGGAGGGGGGAGCGGGGAGGAGC 
GGCC 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I 

I I I I 

15 



Hs271m21 
GGCC 



Namnlos 

GAGGGACGGGAGGAGGAGAAAGGTGGAGGAGAA.GGGAGGGGGGAGCGGGGAGGAGC 
4940 4950 4960 4970 4980 499 



3390 3400 3410 3420 3430 3440 

Pagapr 1 . Dna GGGCCTGGGGCCTTGAGGCCCGGGGAGAGCCGGGGAGCCGGGCCCGCGCGCCGAGG 
TAAG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

I I I I 

Hs271m21 GGGCCTGGGGCCTTGAGGCCCGGGGAGAGCCGGGGAGCCGGGCCCGCGCGCCGAGG 
TAAG 

5000 5010 5020 5030 5040 505 



3450 3460 3470 3480 3490 3500 

Pagapr 1 . Dna AGCCAGGGCCCCGGGTTAGCAGGGCTCGGAGAGGGGGCGCGCGGCGTGGTGGGGGA 
GGGG 

I I I I I I I I I I I I I I I I I I I M I I I M I I I I i I I I I I I I I I I I I I I I I I I I I I I I M 

MM 

Hs 2 7 lm2 1 AGCCAGGGCCCCGGGTTAGCAGGGCTCGGAGAGGGGGCGCGCGGCGTGGTGGGGGA 
GGGG 

5060 5070 5080 5090 5100 511 

0 

3510 3520 3530 3540 3550 3560 

Pagapr 1 . Dna GCAGTGGGCGCAGGGCCCAGCTGGGGGAAGCGGGGCTGGGGGAGAGGAGGAACCGC 
GGGG 

I I M M M M M M M M M I M M I M M M M M M M M M I I M M M M M 

MM 

Hs2 71m21 GCAGTGGGCGCAGGGCCCAGCTGGGGGAAGCGGGGCTGGGGGAGAGGAGGAACCGC 
GGGG 

5120 5130 5140 5150 5160 517 

0 

3570 3580 3590 3600 3610 3620 

Pagapr 1 . Dna ATGGAATCGGGGAGCGCTGAGGCGGCCGATGCCGGGAGCGTGGGTAAGCCAGGCTT 
CTGC 

M I M M M M I M M I M i M M M M M M M M I M M M M M M M I M M 

I M I 

Hs271m21 ATGGAATCGGGGAGCGCTGAGGCGGCCGATGCCGGGAGCGTGGGTAAGCCAGGCTT 
CTGC 

16 



Namnlos 

5180 5190 5200 5210 5220 523 

0 

3630 3640 3650 3660 3670 3680 

Pagaprl . Dna GAGCCGCGGGGGCCGGGGGAGAGGAGGTGGTGAGAGGTGGAGT-CCGGGAGGGTTG 
GGGG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I M I I I I I I I I I I I I I I I 

I I I I 

Hs 2 7 lm2 1 GAGCCGCGGGGGCCGGGGGAGAGGAGGTGGTGAGAGGTGGAGTCCCGGGAGGGTTG 
GGGG 

5240 5250 5260 5270 5280 529 



3690 3700 3710 3720 3730 3740 

Pagaprl . Dna CCGAGGGAGGCAGGAGGAGGGTGGGGACAGGCTTTCTCTCCTCCTCTCCCCCCACC 
CCGC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs271m21 CCGAGGGAGGCAGGAGGAGGGTGGGGACAGGCTTTCTCTCCTCCTCTCCCCCCACC 
CCGC 

5300 5310 5320 5330 5340 535 

0 

3750 3760 3770 3780 3790 3800 

Pagaprl . Dna GCGGGGCTCCGCCCCCGCCTCCTCCGCGGGGCGCTCTCTTGGTCCCCAGGCTGAGC 
CCGG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I M I I I I I I i i I 

I I I I 

Hs271m21 GCGGGGCTCCGCCCCCGCCTCCTCCGCGGGGCGCTCTCTTGGTCCCCAGGCTGAGC 
CCGG 

5360 5370 5380 5390 5400 541 

0 

3810 3820 3830 3840 3850 3860 

Pagaprl . Dna TCGGAGCCTGCGAGGCAACCGGCAAGAGGTCGAGTAGTCTCCGGGTGCGGGCCGCG 
CCGG 

I I I I I I I I I I I Ml I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I 11 I I M I I I I 

I I I I 

Hs 2 7 lm2 1 TCGGAGCCTGCGAGGCAACCGGCAAGAGGTCGAGTAGTCTCCGGGTGCGGGCCGCG 
CCGG 

5420 5430 5440 5450 5460 547 



17 



Naninlos 

3870 3880 3890 3900 

Pagaprl.Dna CGGGGCTCGGTCCAGTCCTCATGGCCGCCTCTCACTTAG 
I I I I I I I I M I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I 

Hs271m21 CGGGGCTCGGTCCAGTCCTCATGGCCGCCTCTCACTTAGATGTTGCTGCTGCTGCT 
ACTG 

5480 5490 5500 5510 5520 553 

0 

Hs271in21 GCGCCACTCTTCCTCCGCCCCCCGGGCGCGGGCGGGGCGCAGACCCCCAACGCCAC 
CTCA 

5540 5550 5560 5570 5580 559 

0 



18 



Namnlos 

CC During sequence assembly data is compared from overlapping clone 
s . 

CC Where differences are found these are annotated as variations 
CC together with a note of the overlapping clone name. Note that th 
e 

CC variation annotation may not be found in the sequence submission 

CC corresponding to the overlapping clone, as we submit sequences 

CC with only a small overlap as described above. 

CC This sequence is the entire insert of clone 271M21. 

CC This sequence has been finished according to sequence map criter 

ia as 

CC follows. An attempt is made to resolve all sequencing problems, 
such 

CC as compressions and repeats, but not necessarily within known 
CC annotated human repeat sequence elements (e.g. Alu) . Where the 
CC sequence is ambiguous, there is an annotation using the "unsure" 
CC feature key. 

CC This sequence was generated from part of bacterial clone contigs 
of 

CC human chromosome 6, constructed in collaboration by the Sanger C 
entre 

CC chromosome 6 mapping group and Armin Volz & Andreas Ziegler. Fur 
ther 

CC information can be found at http: //www . Sanger . ac . uk/HGP/Chr 6/ 
CC 271M21 is from the library RPCIl constructed at the Roswell Park 



SCORES Initl: 22943 Initn: 22943 Opt: 22943 z-score: 10976,2 E 

0 : 0 

99.9% identity in 4594 bp overlap 

10 20 

30 

Pagapr2 . Dna ATGTTGCTGCTGCTGCTACTGGCGCC 
ACTC 



Hs271m21 GTCCAGTCCTCATGGCCGCCTCTCACTTAGATGTTGCTGCTGCTGCTACTGGCGCC 
ACTC 

5490 5500 5510 5520 5530 55 

40 

40 50 60 70 ■ 80 

90 

Pagapr2 . Dna TTCCTCCGCCCCCCGGGCGCGGGCGGGGCGCAGACCCCCAACGCCACCTCAGAAGG 
TGCA 

2 



MM 

Hs271m21 
TGCA 

00 



150 

Pagapr2 . Dna 
CTCA 



Hs271m21 
CTCA 

60 



210 

Pagapr2 . Dna 
CCTT 



MM 

Hs271m21 
CCTT 

20 



270 

Pagapr2 . Dna 
CAGC 

I M I 

Hs271m21 
CAGC 

80 



330 

Pagapr2 . Dna 
GCCT 

MM 



Namnlos 

M M M ! I M M M M I M I M M M M I M M M M M I M M I M M M I M I 



TTCCTCCGCCCCCCGGGCGCGGGCGGGGCGCAGACCCCCAACGCCACCTCAGAAGG 
5550 5560 5570 5580 5590 56 

100 110 120 130 140 

TCCTTCTTCGACGACCTCCGGCCCTCCTTCGCTCCACTTCCCTTTCCCTGCATCTC 

M M M M M M M M I M M I M I M M M M M I M M M I M M M M M M I 
TCCTTCTTCGACGACCTCCGGCCCTCCTTCGCTCCACTTCCCTTTCCCTGCATCTC 
5610 5620 5630 5640 5650 56 

160 170 180 190 200 

TTTCTGGTCCTCATCACTATCCCATCAGTCCCACATATCATCCCGGTCTGGCAACC 

M M M M M M M M M M M M M M M M M M M M M I M M M M M M I 
TTTCTGGTCCTCATCACTATCCCATCAGTCCCACATATCATCCCGGTCTGGCAACC 
5670 5680 5690 5700 5710 57 

220 230 240 250 260 

CTGCTCGGCCCGACTTTACTACTGCTGACCTCCTTCTGTCACCCCACGTTACTATC 
M M M M M M M M M M M I M M M M M M M M M M M M M M M M I 
CTGCTCGGCCCGACTTTACTACTGCTGACCTCCTTCTGTCACCCCACGTTACTATC 
5730 5740 5750 5760 5770 57 



280 



290 



300 



310 



320 



ACCTCTTTTCTCTGCCCACATTGCTACACTATACCACCTTCCTGTGCATTTTCTCC 
M M M M M M M M I M M M M M M M M M I M M M M M M I M M M I 

3 



Namnlos 

Hs 2 7 lm2 1 ACCTCTTTTCTCTGCCCACATTGCTACACTATACCACCTTCCTGTGCATTTTCTCC 
GCCT 

5790 5800 5810 5820 5830 58 

40 

340 350 360 370 380 

390 

Pagapr2 . Dna CAATCCCCTTTCCCAGCCCCACATTACTACCTCAATTACTCCCTTTTCTTGGTCCC 
ACTT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M i I M I I I i I I 

I I I I 

Hs 2 7 lm2 1 CAATCCCCTTTCCCAGCCCCACATTACTACCTCAATTACTCCCTTTTCTTGGTCCC 
ACTT 

5850 5860 5870 5880 5890 59 

00 



400 410 420 430 440 

450 

Pagapr2 . Dna TGCTGTCCAGATGATCTTATTAGCCTCCCTTTATCCTCCTATCCTAATTCAACTGG 
AATA 

M I I I I I I I I I I I I M I I I I I I I I I I I I i I I I I M I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs 2 7 lm2 1 TGCTGTCCAGATGATCTTATTAGCCTCCCTTTATCCTCCTATCCTAATTCAACTCG 
AATA 

5910 5920 5930 5940 5950 59 

60 

460 470 480 490 500 

510 

Pagapr2 . Dna TCCTCATTTAGCCTTTTTTTTTAAAGAAAAGCTCCACCCACATATCATACCCTTCA 
TGAT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I 

MM 

Hs271m21 TCCTCATTTAGCCTTTTTTTTTAAAGAAAAGCTCCACCCACATATCATACCCTTCA 
TGAT 



20 



5970 5980 5990 6000 6010 60 



520 530 540 550 560 

570 

Pagapr2 . Dna TTCTTAATTACTTTTCTTTCTTACCTCCACCCAGCACCCTTCCCTCCCCACTTGTG 
GGTT 

I M I M M I M I M M I M M M I M M M I M I M i M M M M M M M M I M 

MM 

Hs2 71m21 TTCTTAATTACTTTTCTTTCTTACCTCCACCCAGCACCCTTCCCTCCCCACTTGTG 
GGTT 

6030 6040 6050 6060 6070 60 



Namnlos 

80 

580 590 600 610 620 

630 

Pagapr2 . Dna CTCTCATCAGCTTTAA.CCCTGGCCCTTTACTCTCTGTCCTTTAGCCAGGGGATCTG 
TACC 

I I I I 



I I I I I I I I M I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 



Hs271m21 CTCTCATCAGCTTTAACCCTGGCCCTTTACTCTCTGTCCTTTAGCCAGGGGATCTG 
TACC 

6090 6100 6110 6120 6130 61 

40 

640 650 660 670 680 

690 

Pagapr2 . Dna TGTCCCCACTCCCACCCTCTAGTGCCCCATCCCTCTTCCTCTGTCCCCAGCCTGCC 
CACA 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I 

MM 

Hs 2 7 lm2 1 TGTCCCCACTCCCACCCTCTAGTGCCCCATCCCTCTTCCTCTGTCCCCAGCCTGCC 
CACA 

6150 6160 6170 6180 6190 62 

00 

700 710 720 730 740 

7 50 

Pagapr2 .Dna GACCACGCCCTACTCTCCCCTTCCTCCCACTGGGGAGCCTGCCTTTTCCTCTTTCC 
CACC 

M M M M M I M M M M M M M I M M M M M M M I M M M M M M M I 

MM 

Hs271m21 GACCACGCCCTACTCTCCCCTTCCTCCCACTGGGGAGCCTGCCTTTTCCTCTTTCC 
CACC 

6210 6220 6230 6240 6250 62 

60 

760 770 780 790 800 

810 

Pagapr2 . Dna ATTCCTCTCTGTATGCCTCCCCGACTCACCCCTTAGGTTGCCAGATCATACACCCG 
CCCT 

M M M M I I M M M M I M M M I M M M M M M M I M M M M M M M I 

MM 

Hs271m21 ATTCCTCTCTGTATGCCTCCCCGACTCACCCCTTAGGTTGCCAGATCATACACCCG 
CCCT 

6270 6280 6290 6300 6310 63 

20 

820 830 840 850 860 

5 



Namnlos 

870 

Pagapr2 . Dna GGGAAGGGGGCATCAGGTACCGGGGCCTGACTCGGGACCAGGTGAAGGCTATCAAC 
TTCC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs271m21 GGGAAGGGGGCATCAGGTACCGGGGCCTGACTCGGGACCAGGTGAAGGCTATCAAC 
TTCC 

6330 6340 6350 6360 6370 63 

80 

880 890 900 910 920 

930 

Pagapr2 . Dna TGCCAGTGGACTATGAGATTGAGTATGTGTGCCGGGGGGAGCGCGAGGTGGTGGGG 
CCCA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

Mil 

Hs2 71m21 TGCCAGTGGACTATGAGATTGAGTATGTGTGCCGGGGGGAGCGCGAGGTGGTGGGG 
CCCA 

. 6390 6400 6410 6420 6430 64 

40 

940 950 960 970 980 

990 

Pagapr2 . Dna AGGTCCGCAAGTGCCTGGCCAACGGCTCCTGGACAGATATGGACACACCCAGCCGC 
TGTG 

II I I I I I II I I I I I I I II I I I I I I I i I I I II I I I I I I I I I I I I I I II I I I I I I I I I 

I I I I 

Hs 2 7 lm2 1 AGGTCCGCAAGTGCCTGGCCAACGGCTCCTGGACAGATATGGACACACCCAGCCGC 
TGTG 

6450 6460 6470 6480 6490 65 

nn 



1000 1010 1020 1030 1040 

1050 

Pagapr2 . Dna GTGAGTAGCCTCGGAAGCCCCTCCCCTCTTCAAGACTATTCCTTTTCCTGCCGCAA 
ACTT 

I I I I I I I M I I I I I I I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I I I M I I I 

MM 

Hs 2 7 lm2 1 GTGAGTAGCCTCGGAAGCCCCTCCCCTCTTCAAGACTATTCCTTTTCCTGCCGCAA 
ACTT 

6510 6520 6530 6540 6550 65 

60 

1060 1070 1080 1090 1100 

1110 

Pagapr2 . Dna AGCATTACTGCTTGCAAGTCAGCACTTTAAATCCAGTATACCAAAATTCACAAATA 

6 



CATT 



Namnlos 

I I I I I I I I Ml I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 



I I I I 

Hs271m21 AGCATTACTGCTTGCAA.GTCAGCACTTTAAATCCAGTATACCAAAATTCACAAATA 
CATT 

20 



6570 6580 6590 6600 6610 66 



1120 1130 1140 1150 1160 

1170 

Pagapr2 . Dna TATTGAATGACTACTACATAAGAGCAATTTTGCTCTGTGCGGTTGGAGGTAGTAGA 
GCTA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I i I I I I I I I I I I I I I I I I I I 

I I I I 

Hs2 71m21 TATTGAATGACTACTACATAAGAGCAATTTTGCTCTGTGCGGTTGGAGGTAGTAGA 
GCTA 

6630 6640 6650 6660 6670 66 

80 

1180 1190 1200 1210 1220 

1230 

Pagapr2 . Dna GCAGCCTGCACAGTTCATTTCATCCTCCCTTCATTAGGCCACTGATCATTGGCCTA 
TAAC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Hs271m21 GCAGCCTGCACAGTTCATTTCATCCTCCCTTCATTAGGCCACTGATCATTGGCCTA 
TAAC 

5690 6700 6710 6720 6730 67 

40 

1240 1250 1260 1270 1280 

1290 

Pagapr2 . Dna ATTGATAATTCATCTTGTCAGTTATTCTCTTTGAGGATCATTAGTGGCAGATGATG 
ACAA 

I I M I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I i I I i I I M I I 

I I I I 

Hs271m21 ATTGATAATTCATCTTGTCAGTTATTCTCTTTGAGGATCATTAGTGGCAGATGATG 
ACAA 

00 



6750 6760 6770 6780 6790 68 



1300 1310 1320 1330 1340 

1350 

Pagapr2 .Dna AAAAATTCTAAAATGATTTCATCACATTTTTGAATACCTCTGTCACCAACCCAGAG 
ACCA 

I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I 

I I I I 



Namnlos 

Hs271m21 AAAAATTCTAAAATGATTTCATCACATTTTTGAA.TACCTCTGTCACCAACCCAGAG 
ACCA 

6810 6820 6830 6840 6850 68 

60 

1360 1370 1380 1390 1400 

1410 

Pagapr2 . Dna TATGCCCAAGAAACAAAAGCCAGTTTAATATTAATAGAAGCCAACTATAATAAGAA 
AAGC 

I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I 

I I I I 

Hs271m21 TATGCCCAAGAAACAAAAGCCAGTTTAATATTAATAGAAGCCAACTATAATAAGAA 
AAGC 

6870 6880 6890 6900 6910 69 

20 

1420 1430 1440 1450 1460 

1470 

Pagapr2 . Dna AAATCTGATTGTGCATCCAAAGTTATATACATCTACATATTTCAAAGCCAGAGAAC 

CGCC 

I I I I I I I I I I M I I I I I I I I I I M .I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I 

I I I I 

Hs 2 7 lm2 1 AAATCTGATTGTGCATCCAAAGTTATATACATCTACATATTTCAAAGCCAGAGAA.C 
CGCC 

6930 6940 6950 6960 6970 69 

80 



1480 1490 1500 1510 1520 

1530 

Pagapr2 . Dna CACTGTAGCTGACTTTGAAGAGATCCCATTTTGTGTGCTTATAGCCCCATCTTGGG 
TTCC 

^ ^ ^ ^ I I I M I M I I I M I I M I I I I I I I I I I I I I I I M I I I I I M I I I I I I I n I I I I I I 

Hs2 71m21 CACTGTAGCTGACTTTGAAGAGATCCCATTTTGTGTGCTTATAGCCCCATCTTGGG 
TTCC 

6990 7000 7010 7020 7030 70 

4 0 



1590 



1540 1550 1560 1570 1580 



Pagapr2 . Dna TAAAATGGTAATTTTTTTTTTCTTTTGGGAATGTGTGGATGCTTGCACAGGTAAGG 
GAGG 

I ^ I I II I I I I I M I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I 

Hs 2 7 lm2 1 TAAAATGGTAATTTTTTTTTTCTTTTGGGAATGTGTGGATGCTTGCACAGGTAAGG 
GAGG 



8 



Namnlos 

7050 7060 7070 7080 7090 71 



00 



1600 1610 1620 1630 1640 

1650 

Pagapr2 . Dna ATTGGAAGATAGGTAGGCAAATCCTTTTCACATGTGATTTTCTTTAGAGCAGGATG 
CTTG 

I i I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs2 71m21 ATTGGAAGATAGGTAGGCAAATCCTTTTCACATGTGATTTTCTTTAGAGCAGGATG 
CTTG 

7110 7120 7130 7140 7150 71 

60 

1660 1670 1680 1690 1700 

1710 

Pagapr2 .Dna TGGACCCAAACCTGCACCTGAGTCCCCTGCTCTTTAAAGGGAAAGAGCCTTCTTCA 
ACTC 

i I I I 



I I I I I I M I I I I I I I I I I I I I I I I I I I i I I I I I I I M I I I M I I I I I I I I I I I I 



Hs2 71m21 TGGACCCAAACCTGCACCTGAGTCCCCTGCTCTTTAAAGGGAAAGAGCCTTCTTCA 
ACTC 

7170 7180 7190 7200 7210 72 

2 0 

1720 1730 1740 1750 1760 

1770 

Pagapr2 . Dna GCCTCTCTTCTTATTTTCCTATCTCTCCACAGTCCGAATCTGCTCCAAGTCTTATT 
TGAC 

I I i I I I I I I i I I I I I M I M I I I I I I I I I I I I i I I I I M I I I I i I I I I M I I I M I 

I I I I 

Hs2 71m21 GCCTCTCTTCTTATTTTCCTATCTCTCCACAGTCCGAATCTGCTCCAAGTCTTATT 
TGAC 

80 



7230 7240 7250 7260 7270 72 



1780 1790 1800 1810 1820 

1830 

Pagapr2 . Dna CCTGGAAAATGGGAAGGTTTTCCTGACGGGTGGGGACCTCCCAGCTCTGGACGGAG 
CCCG 

I I I I I I I I I I I I 11 I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I 

I I I I 

Hs2 71m21 CCTGGAAAATGGGAAGGTTTTCCTGACGGGTGGGGACCTCCCAGCTCTGGACGGAG 
CCCG 

7290 7300 7310 7320 7330 . 73 

40 



9 



1890 

Pagapr2 . Dna 
GTAG 



Hs271m21 
GTAG 



00 



1950 

Pagapr2 . Dna 
TGCA 

I I I I 

Hs271m21 
TGCA 

60 



2010 

Pagapr2 . Dna 
GAAA 



Hs271m21 
GAAA 

20 



Namnlos 

1840 1850 1860 1870 1880 

GGTGGATTTCCGGTGTGACCCCGACTTCCATCTGGTGGGCAGCTCCCGGAGCATCT 
I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I 1 I I I I I I M I I I I I I M I I I I I I I I 
GGTGGATTTCCGGTGTGACCCCGACTTCCATCTGGTGGGCAGCTCCCGGAGCATCT 
7350 7360 7370 7380 7390 14 



1900 



1910 



1920 



1930 



1940 



TCAGGGCCAGTGGAGCACCCCCAAGCCCCACTGCCAGGGTGAGGGGAACAGCTGCC 

I I I I I I I II I I I I II II I II I I I I I I II I I I I II I II II I I I I 1.1 II I II I I I I II 
TCAGGGCCAGTGGAGCACCCCCAAGCCCCACTGCCAGGGTGAGGGGAACAGCTGCC 
7410 7420 7430 7440 7450 74 

1960 1970 1980 1990 2000 

TGCAGCTGATGAGGACGCTTGTGTGAGGATGGGAGTGGGGTGGGAATGGATAATGG 
I II I II I I I M II M I I I I II M I I I I M I M I II I I I I I I I I I I I II I II I II II 
TGCAGCTGATGAGGACGCTTGTGTGAGGATGGGAGTGGGGTGGGAATGGATAATGG 
7470 7480 7490 7500 7510 75 



2020 2030 2040 2050 2060 

2070 

Pagapr2 . Dna GAATGGAGAGCTATAAAAATGTGGGGGAGGACACTGGAAAGGGGAGATGAAAGTCC 
CTTT 

I I I i I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
GAATGGAGAGCTATAAAAATGTGGGGGAGGACACTGGAAAGGGGAGATGAAAGTCC 
7530 7540 7550 7560 7570 75 



MM 

Hs271m21 
CTTT 

80 



2080 2090 2100 2110 2120 

2130 

Pagapr2 . Dna TTCCTCCATCACCTGCCTCAAACTTCCTCTTGCAGTCCCCGGTATCCTCTGTAGGT 

10 



TGGG 



Namnlos 

I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I 



I I I I 

Hs 2 7 lm2 1 TTCCTCCATCACCTGCCTCAAACTTCCTCTTGCAGTCCCCGGTATCCTCTGTAGGT 
TGGG 



7590 7600 7610 7620 7630 76 



40 



2140 2150 2160 2170 2180 

2190 

Pagapr2 . Dna GGCTTCCTTCCTTTACCTTTTAAAAAAATCTTCCTGCTCCCGATTCTTAGACCTCA 
CGTT 

I I I I I M I I I I I I M I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I M I I I i M 

I I I I 

Hs271m21 GGCTTCCTTCCTTTACCTTTTAAAAAAATCTTCCTGCTCCCGATTCTTAGACCTCA 
CGTT 

00 



7650 7660 7670 7680 7690 77 



2200 2210 2220 2230 2240 

2250 

Pagapr2 . Dna TTCTCTTTTCCTTTATGAATCTCACCTCTCTCACCTTCTTCAGGTTTAAATACTCC 
AATT 

I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I II I I M I I I I I I 

I I II 

Hs2 7 lm2 1 TTCTCTTTTCCTTTATGAATCTCACCTCTCTCACCTTCTTCAGGTTTAAATACTCC 
AATT 

7710 7720 7730 7740 7750 77 

60 

2260 2270 2280 2290 2300 

2310 

Pagapr2 . Dna TTCCCTTTCTCTAAACTTAGAAATTTCCATGCATCACCCTCTTCTAGAATTCATCC 
CTCA 

I I II I I I II I M M I II I II M I II I II M I I I I I I I II I I M I I M I II I I I I M 

1 1 1 1 

Hs271m21 TTCCCTTTCTCTAAACTTAGAAATTTCCATGCATCACCCTCTf CTAGAATTCATCC 

CTCA 

20 



7770 7780 7790 7800 7810 78 



2320 2330 2340 2350 2360 

2370 

Pagapr2 . Dna CCATTCCTTATATAATTGATTTATTGTAAAGACTCAGAAATAAATCAAACATTCTA 
CTAA 

I M I I I II I I I I I I I I I I II I I II I I I I I II I M I I I I I M M I II M I I M I I I I 

11 



Namnlos 

MM 

Hs271m21 C CAT T C C T T AT AT AAT T GAT T TAT T GT AAAGAC T C AGAAAT AAAT C AAAC AT T C T A 

CTAA 

80 



7830 7840 7850 7860 7870 78 



2380 2390 2400 2410 2420 

2430 

Pagapr2 . Dna GAAAAATTGAGAAGGGGAGCTCTGGGGGTGGAAACATATTAGGGTAAAAGACTTAA 
AATT 

I M I M M M M M M M I M M M I M M M M M M I M I M I I M M M M M 

MM 

Hs271in21 GAAAAATTGAGAAGGGGAGCTCTGGGGGTGGAAACATATTAGGGTAAAAGACTTAA 
AATT 

7890 . 7900 7910 7920 7930 79 

40 

2440 2450 2460 2470 2480 

2 4 90 

Pagapr2 . Dna GGAGGCAGCATTATCAGAAGATGAAGAACAACTCAGGGATGGGGTGGGAAGAAGAC 
AGGT 

^ ^ ^ ^ M M M M M M M M M M M M M M M M M M M M M M M I M M M M I 

Hs 2 7 lin2 1 GGAGGCAGCATTATCAGAAGATGAAGAACAACTCAGGGATGGGGTGGGAAGAAGAC 
AGGT 

7950 7960 7970 7980 7990 80 



00 



2550 



2500 2510 2520 2530 .2540 



Pagapr2 . Dna CCTTTTCTGKACTTCCTAGACAACCTCCATTATTCCCTAAGGGAATCAGTGTTGTG 
TCTG 

^ ^ ^ I I I I M M M : I M M M M M M M M M I M M I M M I M M M M l-M M M I 

Hs271m21 CCTTTTCTGTACTTCCTAGACAACCTCCATTATTCCCTAAGGGAATCAGTGTTGTG 
TCTG 

8010 8020 8030 8040 8050 80 

60 



2610 



2560 2570 2580 2590 2600 



Pagapr2 . Dna TCTACYTTTTTTTTTTTTTTTTTGCCACGTAATTTTACAAACTCTCCCTTTTCTAG 
GCAC 

I ^ ^ I N M MM M M M M M M M M M M I M M I M M I M I M M I M I M I I M 

Hs2 71m21 TCTACTTTTTTTTTTTTTTTTTTGCCACGTAATTTTACAAACTCTCCCTTTTCTAG 
GCAC 

12 



8070 



Namnlos 
8080 8090 



20 



8100 



8110 



81 



2620 2630 2640 2650 2660 

2670 

Pagapr2 . Dna CCGAACTCTCTGCCATCTTCTCTCCTGGGATGCAGTCATCCCATTTGTATGCCTCA 
TACT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I M I I I I I I I I M I I I I I I I I I 
CCGAACTCTCTGCCATCTTCTCTCCTGGGATGCAGTCATCCCATTTGTATGCCTCA 



I I I I 

Hs271m21 
TACT 



8130 



8140 



80 

2730 

Pagapr2 . Dna 
TCAG 

I I I I 

Hs271m21 
TCAG 

40 

2790 



8150 



8160 



8170 



81 



2680 2690 2700 2710 2720 

TCCTCTACCCTGGTAGATTCTTTCAAGATCCTTGGGCTTTACTTTCCTCACATAAC 
I I I I I I I I I M I II I I i I I I M I I I I I I I I I I M M I I I I I I I I I I I I M I I I I I I 
TCCTCTACCCTGGTAGATTCTTTCAAGATCCTTGGGCTTTACTTTCCTCACATAAC 
8190 8200 8210 8220 8230 82 



2740 



2750 



2760 



2770 



2780 



Pagapr2 . Dna TTATTCTGCTTCTAGTTTACCATTTTATTCTGGAAATTGAGAGTCCCATCCAGGGG 

TGGA 

< I I M M I I I I I I I I I I I I I I M I I I II I II II M M I I I I II I I I M I II I I II I 



I I I I 

Hs271ni21 
TGGA 

00 



2850 

Pagapr2 . Dna 
CTGT 

MM 

Hs271m21 
CTGT 

60 



TTATTCTGCTTCTAGTTTACCATTTTATTCTGGAAATTGAGAGTCCCAT-CCAGGGG 
8250 8260 8270 0280 8290 83 

2800 2810 2820 2830 2840 

CTTATGACACTACTGAAACTTAGACTTCAAGGTTCCTCACCTACAGGGCCCTCTTC 

< < M I I M II I II II I I II II I I II M I II I I II I II II I II II M I II I 

CTTATGACACTACTGAAACTTAGACTTCAAGGTTCCTCACCTACAGGGCCCTCTTC 
8310 8320 8330 8340 8350 83 



13 



Namnlos 

2860 2870 2880 2890 2900 

2910 

Pagapr2 . Dna GCTCTAATAATATAGAGGGCTCGATGGATATGTGTTCATATGGTAACAGGCTTTTG 
TAAA 

M I I I I I I i I I I I I I I I I I I I I M I I I I I I i I I I I I I M I I I I I I I I I I I I I I I I I 

I I I I 

Hs271m2 1 GCTCTAATAATATAGAGGGCTCGATGGATATGTGTTCATATGGTAACAGGCTTTTG 
TAAA 

8370 8380 8390 8400 8410 84 

20 

2920 2930 2940 2950 2960 

2970 

Pagapr2 . Dna AATTGCAGAAATAAGATTTTAACAGCAATTGCTTAAAGCCAATTGTATGTGTAATT 
TTTT 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I M I I I 

I I I I 

Hs271m21 AATTGCAGAAATAAGATTTTAACAGCAATTGCTTAAA.GCCAATTGTATGTGTAATT 
TTTT 

8430 8440 8450 8460 8470 84 



80 



3030 



2980 2990 3000 3010 3020 



Pagapr2 . Dna TTCTTAAAGACTCCCAATTTTGTAATATTCAGGCACCACAGAACCAAGATCTGCCC 
CAAA 

I I I I I M I I I I I I I I I I M I I I I I I I M I I I I M I I I I I I I I I I I I M I I I I M I I 

I I I I 

Hs2 71m21 TTCTTAPAGACTCCCAATTTTGTAATATTCAGGCACCACAGAACCAAGATCTGCCC 
CAAA 

8490 8500 8510 8520 8530 85 

40 



3090 



3040 3050 3060 3070 3080 



Pagapr2 . Dna CTTAGCTATTGGCATTCCCGTCTCAAATTCTGTTGTCCTATGAAAAATCGAAGAAG 
AAAA 

I ^ I I I M I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I i I I I I I I I I I I I I I I I I I I i 

Hs2 71m21 CTTAGCTATTGGCATTCCCGTCTCAAATTCTGTTGTCCTATGAAAAATCGAAGAAG 
AAAA 

8550 8560 8570 8580 8590 86 

00 



3150 



3100 3110 3120 3130 3140 

14 



Namnlos 

Pagapr2 , Dna TAAGTCCTGACCCCCTTACCCCCAGACCCACCTTGTTCTTATCCCCAGGCACCCTC 
CCCT 

I I I I I I I I I I I I I M I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs 2 7 lm2 1 TAAGTCCTGACCCCCTTACCCCCAGACCCACCTTGTTCTTATCCCCAGGCACCCTC 
CCCT 

60 



8610 8620 8630 8640 8650 86 



3160 3170 3180 3190 3200 

3210 

Pagapr2 . Dna CAGAAACGCAGGCTTCTGCTCTCCCCGGTCTTCAGCATGGACAGGTGTGGGAGGGG 
GCTG 

I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

1 1 1 1 

Hs271in21 CAGAAACGCAGGCTTCTGCTCTCCCCGGTCTTCAGCATGGACAGGTGTGGGAGGGG 
GCTG 



8670 8680 8690 8700 8710 87 



20 



3220 3230 3240 3250 3260 

3270 

Pagapr2 . Dna GGGATCAGGCCAGGGAAGCTGGGCGCCAGTGGTAACTCTTCTCTGATCCCCGTCTT 
TCCT 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs271m21 GGGATCAGGCCAGGGAAGCTGGGCGCCAGTGGTAACTCTTCTCTGATCCCCGTCTT 
TCCT 

8730 8740 8750 8760 8770 87 

8 0 

3280 3290 3300 3310 3320 

3330 

Pagapr2 . Dna GCTGCCAGTGAATCGAACGCCACACTCAGGTGAGATGAGAAA.CCCTTACCGCGCGC 
ACTG 

I I I I I I I I I I I M i I I M I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I 

MM 

Hs271m21 GCTGCCAGTGAATCGAACGCCACACTCAGGTGAGATGAGAAACCCTTACCGCGCGC 
ACTG 

8790 8800 8810 8820 8830 88 

40 

3340 3350 3360 3370 3380 

3390 

Pagapr2 . Dna CAATGCCCTCCCCTTCACTCTGCACCCTCCACCCCCCTGAAATTCTGCCCTTAGGC 
TACG 

M M M M M M M M M M M M M M I M M M M I I I M I M M M M M M I 

15 



Namnlos 

I I I I 

Hs271m21 CAA.TGCCCTCCCCTTCACTCTGCACCCTCCACCCCCCTGAAATTCTGCCCTTAGGC 
TACG 

00 



8850 8860 8870 8880 8890 89 



3400 3410 3420 3430 3440 

3450 

Pagapr2 . Dna GGGCGTCGTCCTTTCGCACCTTCCCCAACCCACCCCAGTTTGCGGCCACCCCCTTC 
CCTC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I M I I I I I I I I I M I I j I I I I 

I I I I 

Hs271in21 GGGCGTCGTCCTTTCGCACCTTCCCCAACCCACCCCAGTTTGCGGCCACCCCCTTC 
CCTC 

60 



8910 8920 8930. 8940 8950 89 



3460 3470 3480 3490 3500 

3510 

Pagapr2 . Dna CCTACCTGTTTCCTGCCTCCAGTCCCGGTTTTCCACGAGGCTGCGGTCTCTCCTTG 
TCCC 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

I I I I 

Hs271m21 CCTACCTGTTTCCTGCCTCCAGTCCCGGTTTTCCACGAGGCTGCGGTCTCTCCTTG 
TCCC 

8970 8980 8990 9000 9010 90 

20 



3520 3530 3540 3550 3560 



!570 



Pagapr2 . Dna TGCTTGGCTACACTTCCCTGGGCTCCACCTCCTCCCAGACTGAGCCTCGCCGGTGT 
CAGG 

I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I 11 I I I M I I I 

MM 

Hs271m21 TGCTTGGCTACACTTCCCTGGGCTCCACCTCCTCCCAGACTGAGCCTCGCCGGTGT 
CAGG 

80 



9030 9040 9050 9060 9070 90 



3580 3590 3600 3610 3620 

3630 

Pagapr2 . Dna CAGAGCCCAGCAGARGGCGGCAGGGTGCTGGGAGACCCTGAGCTCCCACCACGTTT 
TCCC 

I M I M M I M M M M M M M M M M M I M M M I M I I M M M M M M I 

MM 

Hs271m21 CAGAGCCCAGCAGAGGGCGGCAGGGTGCTGGGAGACCCTGAGCTCCCACCACGTTT 
TCCC 

16 



Namnlos 

9090 9100 9110 9120 9130 91 



40 



3640 3650 3660 3670 3630 

3690 

Pagapr2 . Dna CTGTGGGGTTCCTTGCGACCTTCGCTGGAACCTTTTCCAGCCTGCTGCCTCCTAGG 
ATTT 

I I I I I I I I I I I I I I I I I I I I I I M I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

MM 

Hs2 71m21 CTGTGGGGTTCCTTGCGACCTTCGCTGGAACCTTTTCCAGCCTGCTGCCTCCTAGG 
ATTT 

9150 9160 9170 9180 9190 92 

00 

3700 3710 3720 3730 3740 

3750 

Pagapr2 . Dna CACCTAATGGACTTTCTCAGCCTGTCCCACCCATCCCAACCCTGGCCAGGCCTCTC 
GCGC 

I M M M M M M M M I M M M M M I M M M M M M M M M M M M M I 

MM 

Hs2 71m21 CACCTAATGGACTTTCTCAGCCTG.TCCCACCCATCCCAACCCTGGCCAGGCCTCTC 
GCGC 



9210 9220 9230 9240 9250 92 



60 



3760 3770 3780 3790 3800 

3810 

Pagapr2 . Dna TCTTCCCCACATCTTTTCCTTCCGTGTACCCCTTCCCTCGTCTTTTCTCAATTCCA 
TGTC 

I M I M M M M M M M M M M M M M M M M M M M M M M M I M M I 

I M I 

Hs271m21 TCTTCCCCACATCTTTTCCTTCCGTGTACCCCTTCCCTCGTCTTTTCTC-AATTCCA 
TGTC 

9270 9280 9290 9300 9310 93 

20 • 

3820 3830 3840 3850 3360 

3870 

Pagapr2 . Dna CTGTCTCCCTTTCTTAGGCTTCTGTCTACCCAGCCCCAGGCTCCCTTCCACGACCC 
CACC 

MM 



M M M M I M M M M M I M M M M M M I M M M M I M M M M M M 



Hs271m21 CTGTCTCCCTTTCTTAGGCTTCTGTCTACCCAGCCCCAGGCTCCCTTCCACGACCC 
CACC 

9330 9340 9350 9360 9370 93 

80 

17 



Namnlos 

3880 3890 3900 3910 3920 

3930 

Pagapr2 . Dna ACTCCCTCAAACCAGCCTCCCTTCCGTACCCAACTCGTTCCCTCCAAAACCGTTTC 
CTCT 

I I I I I I I I I I I I I I I I I I I I I j M I I I I I I I I I I I I I I I I M I M I I I i I I M I I I 

I I I I 

Hs 2 7 lm2 1 ACTCCCTCAAACCAGCCTCCCTTCCGTACCCAACTCGTTCCCTCCAAAACCGTTTC 
CTCT 

40 



9390 9400 9410 9420 9430 94 



3940 3950 3960 3970 3980 

3990 

Pagapr2 . Dna CCCCCACATCCTCAGTGCTTCACTGTATCGACTCATACTCCCACTTCAGACCTCAG 
GCGC 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I M I I I I I I I M I I I I I I I I 

INI 

Hs271m21 CCCCCACATCCTCAGTGCTTCACTGTATCGACTCATACTCCCACTTCAGACCTCAG 
GCGC 

9450 9460 9470 9480 9490 95 

00 

4000 4010 4020 4030 4040 

4 050 

Pagapr2 . Dna CAGCCCCGTTTCTCTCCCGTCCCACTCGCATCCTTCCCTTCCTACCCTGGTTCCTC 
CGTG 

I I i I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

I I 1 I 

Hs2 7 lni2 1 CAGCCCCGTTTCTCTCCCGTCCCACTCGCATCCTTCCCTTCCTACCCTGGTTCCTC 
CGTG 

9510 9520 9530 9540 9550 95 

60 



4110 



4060 4070 4080 4090 4100 



Pagapr2 . Dna CTTCAGCCTCCCGCGGCTCCCTCCGCCCACCCCGCCCTCCTGGCACGCCCCGTCCC 
CATT 

M I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

MM 

Hs271m21 CTTCAGCCTCCCGCGGCTCCCTCCGCCCACCCCGCCCTCCTGGCACGCCCCGTCCC 
CATT 

9570 9580 9590 9600 9610 96 

2 0 

4120 4130 4140 4150 4160 

4170 

18 



Namnlos 

Pagapr2 . Dna TCTCCTCCCCTCGGGTCCCCTTAA.GTGAGATCCCTCCCTTCCTCTTTCGTTCCTTT 
CCTC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

MM 

Hs271m21 TCTCCTCCCCTCGGGTCCCCTTAAGTGAGATCCCTCCCTTCCTCTTTCGTTCCTTT 
CCTC 

9630 9640 9650 9660 9670 96 

80 



4180 4190 4200 4210 4220 

4230 

Pagapr2 . Dna CTCGAGGTTGCATCCCCCCTCCCCTCCCCGCCCCTCCGACTGTCGCTCCCACCTCG 
GCGC 

M M M M M M M M M M M M M I M M I M M M I M M M M M M M M I 

MM 

Hs2 71m21 CTCGAGGTTGCATCCCCCCTCCCCTCCCCGCCCCTCCGACTGTCGCTCCCACCTCG 
GCGC 

40 



9690 9700 9710 9720 9730 97 



4240 4250 4260 4270 4280 

4290 

Pagapr2 . Dna TCGCTTCCCTCCCCGCCCCCTTCCTGCCTCCCCAGCTCCCGCCCGCCCCCCCACCC 
CCCG 

I I M M M M M M M M M I M M M M M M M M M I M M M M M M M M 

MM 

Hs2 71ra21 TCGCTTCCCTCCCCGCCCCCTTCCTGCCTCCCCAGCTCCCGCCCGCCCCCCCACCC 
CCCG 



9750 9760 9770 9780 9790 98 



00 



4300 4310 4320 4330 4340 

4350 

Pagapr2 . Dna CTGCCGCGCGCCGCCCGTGACGTCAGAGCCCCCTCCCAGCCCCACATCTCCCTCCT 
GCTC 

M M M I M I M I M M M M M M I M M M M I M M M M M M M M M M I 

I M I 

Hs271m21 CTGCCGCGCGCCGCCCGTGACGTCAGAGCCCCCTCCCAGCCCCACATCTCCCTCCT 
GCTC 

60 



9810 9820 9830 9840 9850 98 



4360 .4370 4380 4390 4400 

4410 

Pagapr2 . Dna CTCCTCCTCCCCTCCGTCGGTCAGTCAGTCCGCGAGGAGAGTCCGCGGTGGCGGCG 
ACGG 

M M M M M M I M M M M M M M M M M M M M M M M M M M M I M 

19 



Namnlos 



Hs2 71m21 CTCCTCCTCCCCTCCGTCGGTCAGTCAGTCCGCGAGGAGAGTCCGCGGTGGCGGCG 
ACGG 

9870 9880 9890 9900 9910 99 

20 

4420 4430 4440 4450 4460 

4470 

Pagapr2 . Dna TGGCGAGAGCCGCGGGGGCCGTAGGAAGCCAACCTTCCCTGCTTCTCCGGGGCCCT 
CGCC 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M 

I I I I 

Hs271m21 TGGCGAGAGCCGCGGGGGCCGTAGGAAGCCAACCTTCCCTGCTTCTCCGGGGCCCT 
CGCC 

RO 



9930 9940 9950 9960 9970 99 



4480 4490 4500 4510 4520 

4530 

Pagapr2 . Dna CCCTCCTCCCCACAAAATCAGGGATGGAGGCGCCTCCCCGGCACCCTCTTAGCAGC 
CCTC 

I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M 

I I I I 

Hs271m21 CCCTCCTCCCCACAAAATCAGGGATGGAGGCGCCTCCCCGGCACCCTCTTAGCAGC 
CCTC 

40 



9990 10000 10010 10020 10030 100 



4540 4550 4560 4570 4580 

4590 

Pagapr2 . Dna CCCGGGAAAAGTGTCCCCCCTGAGCTCCTAACGCTCCCCAACAGCTACCCCTGCCC 
CCCA 

Ml I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Hs271m21 CCCAGGAAAAGTGTCCCCCCTGAGCTCCTAACGCTCCCCAACAGCTACCCCTGCCC 
CCCA 

10050 10050 10070 10080 10090 101 

00 



Pagapr2.Dna CGCC 
I I I I 

Hs271in21 CGCCATGGGGCCCGGGGCCCCTTTTGCCCGGGTGGGGTGGCCACTGCCGCTTCTGG 

20 



Namnlos 

TTGT 

10110 10120 10130 10140 10150 101 

60 



21 



